SYSTEM AND METHOD OF AN EFFICIENT SNAPSHOT FOR SHARED 

STORAGE 



A rKr.ROITND OF TH F INVENTION 

Field of the Invention 

[0001] The present invention relates to a snapshot technique for a shared storage 
supporting large capacity, and more particularly, to a snapshot technique which supports an 
on-line backup for a large logical volume based on a storage area network (SAN). 
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Discussion of the R elated Art 

[0002] In recent years, as technical developments of Internet-based application such 
as an electronic commerce and the like are made and users of Internet are rapidly spreading, a 
quantity of data in service increases exponentially. For this reason, there are demands for 
15 large network storage which can effectively share a large quantity of information and provide 
service at high speed. 

[0003] As an example of technologies for implementing the large network storage, a 
network attached storage (NAS) and a storage area network (SAN) have been proposed. The 
SAN is a data file-oriented computer system environment which can directly access to a 
2 0 storage connected to a network, not via a server. 

[0004] In the meantime, enterprise systems that must support 24X7X365 
environment require data availability and reliability as well as high-speed processing of large- 
sized data. Among several methods of ensuring the reliability and availability, one method 
that can meet the requirements of these systems is an on-line backup whose importance has 
2 5 been stressed. A backup execution time increases exponentially in order for a backup of a 



large-sized data. Therefore, it is essential to provide an on-line snapshot based on a mapping 
table, since a system which stops its operation in order to execute the backup and then 
resumes its service is not useful. Here, the snapshot is a technique for storing and retaining 
data state at specific time when a user wants. The snapshot is a useful technique for the on- 
5 line backup and the like. 

[0005] The snapshot technique copies only data image, not entire data, and retains 
data obtained at the moment the snapshot is created. If data block is modified after the 
snapshot is created, a new block is allocated, and then, the data at the moment of the snapshot 
is copied. Thereafter, mapping entry values are changed so as to map a data block which is 

10 newly allocated. In other words, a copy-on-write (COW) operation is performed in order to 
retain the data obtained at the time of the snapshot creation. 

[0006] However, when the snapshot creation request is carried out in the on-line 
snapshot based on a conventional mapping table, a service cannot be processed because all 
hosts' access to an original volume is disconnected while copying the snapshot mapping table. 

15 As a size of the volume becomes larger, the mapping table increases. As a result, I/O access 
protection time also increases proportionally. 

[0007] In addition, the write operation of data block occurring after the COW 
operation requires many disk I/O operations, thereby degrading I/O performance of the 
volume. At a snapshot destruction operation, in order for the deallocation of the data block 

2 0 allocated by the COW operation, it is checked whether the data block of the original volume 
and the data block of snapshot volume are updated or not, and the newly allocated data block 
should be deallocated, thus increasing the snapshot destruction execution time. 

25 
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SUMMARY OF THE INVENTION 
[0008] The present invention is directed to a snapshot technique for shared storage 
that substantially obviates one or more problems due to limitations and disadvantages of the 
related art. 

5 [0009] An object of the present invention is to provide an improved snapshot method 

which supports an on-line backup for large logical volume based on a storage area network 
(SAN) providing a shared storage supporting large capacity. 

[0010] According to the present invention, performance of the write operation is 
improved by omitting a read operation to a snapshot mapping block which is required for a 
10 determination of COW in case of a write operation and a snapshot destruction operation by 
adding information such as FAB and SSB into a mapping entry. 

[0011] Additional advantages, objects, and features of the invention will be set forth 
in part in the description which follows and in part will become apparent to those having 
ordinary skill in the art upon examination of the following or may be learned from practice of 
15 the invention. The objectives and other advantages of the invention may be realized and 
attained by the structure particularly pointed out in the written description and claims hereof 
as well as the appended drawings. 

[0012] To achieve these objects and other advantages and in accordance with the 
purpose of the invention, as embodied and broadly described herein, there is provided a 
2 0 method of creating a snapshot for on-line backup in a network storage based on a storage area 
network (SAN), which comprises the steps of: changing an volume operation mode of all 
nodes, in which a mapping server exists, into a snapshot create mode; locking a mapping 
block by increasing a value of the mapping block by one; if the mapping block is not locked, 
increasing a value of a copy-completed block by one; unlocking the mapping block; and if the 
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copy of all the mapping blocks is completed, generating a volume configuration information 
for the snapshot at an original volume. 

[0013] According to another aspect of the present invention, it provides a method of 
destroying a snapshot for a shared storage supporting large capacity based on a storage area 
network (SAN), which comprises the steps of: changing an volume operation mode of all 
nodes, in which a mapping server exists, into a snapshot destroy mode; locking a mapping 
block by increasing a value of the mapping block by one; determine whether or not a copy- 
on-write (COW) operation is carried out to a data block, which is indicated by a mapping 
entry, by using a first allocation bit (FAB) and a snapshot status bit (SSB); if the COW 
operation is carried out, initializing the FAB and the SSB, and writing the modified mapping 
block onto a disk; unlocking the mapping block; and if an initialization to all the mapping 
blocks is completed, destroying a snapshot volume. 

[0014] It is to be understood that both the foregoing general description and the 
following detailed description of the present invention are exemplary and explanatory and are 
intended to provide further explanation of the invention as claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0015] The appending drawings, which are included to provide a further 
understanding of the invention and are incorporated in and constitute a part of this application, 
illustrate embodiment(s) of the invention and together with the description serve to explain 
the principle of the invention. In the drawings: 

[0016] Fig. 1 illustrates sequential operations of copy-on-write (COW) in a general 
snapshot technique; 

[0017] Fig. 2 is a view of a computer system configuration according to the present 
invention; 



[0018] Fig. 3 is a view of a hierarchical structure of logical volume configuring a 
shared storage according to the present invention; 

[0019] Fig. 4 illustrates a structure of mapping table and mapping entry according to 
the present invention; 

[0020] Fig. 5 is an operational flowchart illustrating a method of creating a snapshot 
for a shared large storage according to an embodiment of the present invention; 

[0021] Fig. 6 is an operational flowchart illustrating a method of destroying a 
snapshot for a shared large storage according to an embodiment of the present invention; and 

[0022] Fig. 7 is an operational flowchart illustrating a write operation for a shared 
large storage according to an embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[0023] Reference will now be made in detail to the preferred embodiments of the 

present invention, examples of which are illustrated in the appending drawings. 

[0024] Fig. 1 illustrates sequential operations of copy-on-write (COW) in a general 

snapshot technique. 

[0025] Referring to Fig. 1, a reference numeral "1 10" represents a file system having 
the latest data, and a reference numeral "120" represents a snapshot area storing data which is 
necessary to retain the modified data block after the snapshot is created. A file A 1 1 1 and a 
file B 1 12 are stored in the file system 1 10. The file A 1 1 1 consists of blocks Al and A2, and 
the file B 1 12 consists of blocks Bl to B3. 

[0026] In such a structure, if a request for reading the first block of the file A 1 1 1 is 
sent from a application before the snapshot is created (151), the file A 111 is read out from 
the disk 1 10 to thereby provide a service. 



[0027] If a request for writing the first block of the file B 112 is occurred after the 
snapshot is created (154), the process proceeds to a step (140) of checking whether it is a first 
update or not. If the write operation to the first block of the file B 1 12 is the first update, the 
first block Bl of the file B 112 stored in the file system 110 is copied (Bl':112) to the 
snapshot area 120 and its information is recorded on the changed block map 121. Then, 
contents to be actually updated are recorded on the first block Bl of the file B 1 12 stored in 
the file system 110. If the write operation is not the first update, the contents are directly 
written on the first block Bl. 

[0028] In a read operation to the snapshot, it is necessary to perform a step (130) of 
checking whether the block is changed or not. As the check result, if not changed, data of file 
system 110 is directly read out. Meanwhile, if changed, a physical position 122 of the 
changed block is searched by carrying out an examination of the changed block map 121 of 
the snapshot area 120, thus reading out data. In other words, if it is a reading (153) of the first 
block Al of the file A, it is the unchanged block. Therefore, "Al" of the file system 110 is 
read out. If it is a reading (152) of the first block B 1 of file B, it is the changed block, so that 
"Bl"' of the snapshot area is read out. 

[0029] Fig. 2 is a view of a system configuration showing a physical environment for 
a logical volume manager (LVM) according to the present invention. 

[0030] The LVM is a shared storage cluster system providing a storage sharing via 
the SAN under a multi-host environment. As shown in Fig. 2, the SAN environment 200 
which is the basis of the LVM is generally provided with three parts. 

[0031] Referring to Fig. 2, the SAN environment 200 includes a network storage pool 
202 for storing and retaining data, a plurality of hosts 203 for sharing and accessing to the 
storage devices in order to provide service to users, and a plurality of switches 201 which is a 
storage area network for connecting the storage devices and the hosts 203 via a fibre channel 



(FC). Each of the hosts 203 uses a host bus adaptor (HBA) for connection with the switches 
201. Also, the hosts 203 and the switches 201 are connected through the fibre channel (FC). 
Each of the hosts is connected through a local access network (LAN) 204 in order to process 
control/management information. If such a physical environment is prepared, an environment 
capable of utilizing the LVM is provided. 

[0032] Fig. 3 is a view of a hierarchical structure of the mapping table based logical 
volume provided by the large logical volume manager, which is used in the computer system 
according to the present invention. The logical volume is generated from a storage pool 300 
which is provided to the hosts shared via the SAN. The Logical Volume 320 according to the 
present invention is abstracted into three parts. 

[0033] A first abstraction is a disk partition or a physical partition 310. The disk 
partition is generated by a tool which is generally provided from an operating system, and it is 
a minimum configuration unit. 

[0034] In other words, more than one disk partition 310 is gathered to form one 
logical volume. A size of the logical volume is changed based on the disk partition unit. The 
disk partition 310 of the volume is generally provided with a volume header area 311 and a 
data area 312 storing actual data. In addition, the volume header area 311 is constituted with 
a volume configuration information area 313, an allocation bitmap table 314, and a mapping 
table 315. 

[0035] A second abstraction is a physical volume 320. The physical volume 320 is 
named after a group of extendable disk partitions 310 and forms a continuous address space. 
A size of the physical volume 320 is changeable in a system operation, and it is a group of the 
disk partition 310 having the same volume configuration information 313. A type of the 
volume configuration information described by the user is generated at the physical partition 
310 configuring the physical volume 320. The physical volume 320 includes a volume 



identification (ID) which is a single identifier for distinguishing it from others within the 
storage pool 300 shared by all the hosts. 

[0036] A last abstraction is an extent 316. The extent 316 is a group of physically 
continuous blocks having the same size. In addition, the extent 316 is a minimum unit of the 
disk space which is allocable in order to store information. 

[0037] A size of the extent 316 is equal with respect to one logical volume and 
determined when creating the logical volume. Different logical volumes can have different 
extents 316 in size and their sizes should be an exponential series of 2 and a multiple of the 
size of hard sector, i.e., the minimum unit of the physical disk 310. 

[0038] After the disk partition 310 is generated by a tool provided in the operating 
system, the logical volume is defined with respect to several disk partitions. At this time, 
information necessary for the volume configuration is also provided. The information 
includes disk specific information such as capacity, # of extent, mapping table size, etc and 
volume information such as volume name, RAID information, and the like. 

[0039] The LVM creates and writes the meta data of the volume configuration onto 
the corresponding physical disk partition header 311 according to the user definition of the 
volume to be generated as above. If the recording of the configuration information in the 
volume header area 311 of all the disk partitions is completed, the physical volume 320 is 
generated and the generated physical volume 320 is registered and used in all the shared hosts. 

[0040] Fig. 4 illustrates a structure of a mapping entry 401 of the mapping table 315 
supporting the configuration and the mapping table scheme of the physical disk for the 
snapshot according to the present invention. 

[0041] The mapping performs a process of converting a logical address of the upper 
module into an actual physical address of the lower disk. In addition, an effective allocation 
and deallocation, and independence between the logical address 402 and the physical address 
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403 in the data block can be provided through the free space manager. A structure of each 
mapping entry 401 configuring the mapping table 315 is generally composed of three parts. 

[0042] Each of the mapping entry 401 includes a first allocation bit (FAB) 407, a 
snapshot status bit (SSB) 408, and a physical address 403 of the physical disk block. The 
5 physical address 403 consists of an address (DiskJD) 404 of the disk partition and an address 
(PhysicalJExtentJD) 405 of the physical block. 

[0043] When the data block is allocated actually by the free space manager, the 
physical address 403 of the mapping entry is changed to map the actual disk block. In the 
snapshot technique according to the present invention, the FAB 407 and the SSB 408 added to 
1 0 the mapping entry 401 is distinguished from a previous structure of the mapping entry. 

[0044] The FAB 407 is allocated to a first bit of all the mapping entries 401 and then 
used. The FAB 407 is a bit for distinguishing the data blocks which is First allocated and used 
after the snapshot is created. After the snapshot is created, if the data block is allocated by the 
free space manager and used, a value of the FAB is changed to "1" and the mapping entry 401 
15 is recorded on the disk. 

[0045] The SSB 408 is a bit representing a status of the snapshot. If the SSB 408 is 
"1", it means that the COW operation is already carried out after the snapshot is created. If 
the SSB 408 is "0", it means an initial value or that the COW is not yet carried out. Since the 
SSB 408 is maintained at each mapping entry and allocated at each snapshot by one bit, bits 
20 are allocated as many as the maximum number of the snapshot. Both the FAB 407 and the 
SSB 408 are initialized to "0" during a mapping table initialization process among the process 
of generating the physical volume on the disk of the shared storage. 

[0046] In general, in the conventional snapshot technique, both the original mapping 
entry and the snapshot mapping entry are read out and compared with the physical address in 
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order to determine whether or not the data change operation should carry out the COW 
operation to the block which is used for the first time after the snapshot is created. 

[0047] In other words, the disk I/O operation is carried out two times. In addition, the 
process of deallocating the COW execution block, which will be carried out when destroying 
the snapshot, a deallocation procedure of the data block should not be carried out to the block 
which is first allocated after the snapshot is created. By the determination, the general 
snapshot technique should carry out two times the disk I/O operation to the original mapping 
entry and the snapshot mapping entry. 

[0048] However, once the process of reading the original mapping entry is carried out, 
the snapshot technique according to the present invention through the FAB on the original 
mapping entry can determine whether or not the COW operation is executed. The FAB 407 
of the mapping entry is set to "1" with respect to the data block which is first allocated and 
used after the snapshot, and an operation of reflecting the mapping entry in the disk is carried 
out. 

[0049] If the FAB is "1" in the process of determining whether or not the COW 
operation is carried out, the contents of the corresponding data block are changed and 
reflected to the change on the disk. If the FAB is "0" among the already allocated block, the 
determination operation is achieved using the SSB. 

[0050] In the snapshot exists, the data block is classified into three cases according to 
their statuses. 

[0051] The first case is a data block which is first allocated/used after the snapshot is 
created. The second case is a data block which is used before the snapshot and not changed 
after the snapshot. In other words, the second case is a data block to which the COW 
operation has not been carried out. The third case is a data block which is changed after the 
snapshot, i.e., a data block to which the COW has been carried out. A status of the data block 
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which is changed after the creation of the snapshot is one of the three cases, and the COW is 
performed to only the second case if the change occurs. 

[0052] Like the first case, among the data blocks of the volume, there may be data 
blocks which are not used before the snapshot creation and first allocated after the snapshot 
creation. If a write operation to these data blocks occurs, a new data block is allocated and a 
write operation on the disk is carried out. 

[0053] However, the COW is not carried out since the blocks are not used before the 
snapshot. In other words, since the COW is not carried out, the SSB maintains an initial value 
"0" as it is. On the contrary, since the data block is in use from the second change and the 
SSB is "0", there is a problem of carrying out the COW operation. 

[0054] In other words, since the snapshot technique according to the present 
invention does not perform additionally the disk access operation in order to read the snapshot 
mapping entry, it is impossible to distinguish the block which is first used after the snapshot 
creation from the block which is allocated before the snapshot creation and to which the 
COW is not carried out. 

[0055] Accordingly, in addition to the SSB, an additional one bit (FAB) is provided 
to mark the data block which is first allocated/use after the snapshot is created. Further, the 
FAB is used to distinguish the block which is first used after the snapshot creation and the 
block which is allocated before the snapshot creation and to which the COW is not performed. 

[0056] In other words, in the case of the block which is not used before the snapshot 
creation and first allocated after the snapshot creation (i.e., in the case of FAB="1"), the 
COW operation is not performed even when the SSB is "0". 

[0057] 1. CREATION OF SNAPSHOT 

[0058] In order to perform the process of creating the snapshot, first, I/O and access 
to the original volume which is the object of the creation of the snapshot should be blocked. 



The original volume has to be frozen until the creation of the snapshot is completed. This 
freezing ensures that the data on the disk is in a consistent state. 

[0059] In the table-based mapping method, a mapping table for corresponding 
volume should exist in order to perform I/O operation to the volume. In other words, in order 
5 to perform the I/O operation to the snapshot volume, the mapping table for the snapshot 
volume should be created, and the mapping table creation operation to the snapshot volume is 
first carried out. 

[0060] The size of the mapping table increases in proportion to that of the volume. If 
the size of the mapping table becomes larger, a time required to generate the mapping table 
10 for the snapshot volume is also increased. In the shared storage based on SAN environment, 
the volume requires several TB to several thousands or more TB in size, and it takes several 
tens seconds to several minutes to generate the snapshot for such a volume. 

[0061] In other words, the hosts sharing and using all the volumes during that period 
stop their processing execution. However, the stopping of the execution for several tens 
15 seconds is not tolerable and thus the general snapshot creation method is not suitable for the 
shared storage based on SAN environment. 

[0062] The present invention employs an operation mode concept of the volume in 
order to minimize a delay of I/O operation to the volume when creating the snapshot. The 
operation mode of the volume is divided into three modes, i.e., a "normal" mode, a "snapshot 
2 0 create" mode and a "snapshot destroy" mode. Like the conventional snapshot technique, the 
snapshot creation method according to the present invention copies the original mapping table, 
and creates the snapshot mapping table. 

[0063] However, when copying the mapping table, this process is carried out 
simultaneously without blocking of the I/O operation to the original volume. At this time, the 
2 5 delay of the I/O operation occurs only while the operation mode of the volume registered in 



the host changes from the normal mode to the snapshot creation mode. This delay is only a 
very short time compared with a time taken to copy the mapping table. The time is so 
negligible that the general user cannot recognize it. 

[0064] Fig. 5 is a flowchart illustrating the method of creating the snapshot according 
to the present invention. The method of creating the snapshot according to the present 
invention will be described below in detail with reference to Fig. 5. 

[0065] First, a change of the configuration information on the original volume 
changed due to the creation of the snapshot is carried out (501). The information change in 
the number of the snapshot and the like is reflected in the registered volume. If the change of 
the configuration information is completed, the operation mode of the volume in a mapping 
server host changes from the normal mode to the snapshot creation mode until the snapshot 
creation is completed (502). In the snapshot creation mode, the access and I/O operation to 
the volume, which are performed by the general different processors, are carried out 
simultaneously while copying the mapping table. 

[0066] Then, in order to copy all the blocks of the original mapping table to the 
snapshot mapping table, a lock of an exclusive mode for the initial mapping block with 
respect to all blocks is acquired (503), and a copy operation of the mapping block is carried 
out (504). If the lock is not acquired, it is examined whether or not the copy operation to all 
the mapping blocks is completed (506). The case of not acquiring the lock is that other 
processor performing the write operation to the same block already acquires and carries out 
the locking operation. 

[0067] In this case, the copy operation to the corresponding mapping block is carried 
out by examining the operation mode of the volume in a write operation and carrying out the 
COW operation. If the lock is acquired, the original mapping block is copied to the snapshot 
mapping block (504) and the lock is released. (505). 
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[0068] Thereafter, it is examined whether or not the copy operation to all the mapping 
blocks of the original volume is completed (506). If not completed, the process returns to the 
step 503. If completed, a volume for the snapshot is allocated, the original volume 
configuration information is copied to the allocated volume, and information such as the 
5 snapshot name and snapshot sequence is reflected in the snapshot volume (507). If the 
creation of the snapshot volume is completed, the operation mode of the mapping server host 
is changed to the normal mode (508). By using the above manner, the creation of the 
snapshot is normally carried out without I/O interrupt of the application program accessing to 
the original volume. 
1 0 [0069] 2. SNAPSHOT DESTRUCTION 

[0070] Meanwhile, in the conventional snapshot technique, one factor causing 
degradation is an overhead accessing to the snapshot mapping block in order to determine the 
execution of the COW operation during the snapshot destruction operation. In the snapshot 
destruction operation, portion causing the overhead is a deallocation operation to the data 
1 5 block allocated by the COW operation. 

[0071] In other words, in the conventional snapshot technique, in order to determine 
whether or not the COW operation is carried out, both the mapping entry and the snapshot 
mapping entry are read out and it is examined whether or not block addresses indicated by the 
two entries are identical to each other. If the addresses of the two physical blocks are 
20 identical to each other, the COW operation is not carried out to the corresponding data block 
and thus only the address of the mapping entry is initialized. 

[0072] If not identical, the deallocation operation is carried out to the data block 
indicated by the snapshot mapping entry, and then the initialization of the mapping entry is 
carried out. If the number of the snapshot is more than one, mapping entries of other 



snapshots are also compared. In other words, an additional disk I/O operation should be 
carried out at least two times. 

[0073] For solving the problem of the conventional snapshot destruction operation, 
which is caused due to the access to the snapshot mapping block when determining whether 
5 or not the deallocation of the block is carried out, the present invention proposes a method of 
introducing the SSB 408 and the FAB 407 into the original mapping entry. The value of the 
SSB 408 is initialized to "0" when the mapping block is initialized in an operation of 
generating the physical volume, and it is changed into "1" when the COW operation is carried 
out. 

10 [0074] The snapshot technique according to the present invention can determine 

whether or not the COW operation is carried out only by reading out the mapping entry of the 
original volume without using the method of reading out the snapshot mapping entry and 
comparing it with the original mapping entry. In other words, as an examination result of the 
FAB 407, if the FAB 407 is "1", it is checked whether the snapshot to be destroyed is a first 

15 one or not. In case the FAB is "1" and the first snapshot, the deallocation is not carried out 
since the corresponding data block is a block which is first allocated and used after the 
snapshot is created. Except this case, the SSB value is examined. 

[0075] If the SSB 408 of the position of the destroying snapshot is "0", it represents 
the case that the COW operation is not carried out. If the SSB 408 is "1", it represents the 

2 0 case that the COW operation is carried out. If the SSB 408 is "0", a next block is processed. 
If the SSB 408 is "1", it is checked whether the next snapshot exists or not. If the next 
snapshot exists, the SSB value of the next snapshot is examined. 

[0076] If the next snapshot does not exist, or if the COW operation is carried out even 
though the next snapshot exists, the current data block is deallocated. In this case, when the 

2 5 next snapshot exists, the COW operation is determined using the SSB value, and whether to 



deallocate the current snapshot is determined. In other words, the snapshot technique 
according to the present invention can execute the snapshot destruction operation through the 
original mapping entry without reading out the value of the snapshot mapping entry. 

[0077] In case there are several snapshots, the conventional snapshot technique 
should carry out the I/O operation of reading out the mapping entry as many as the snapshots. 
This operation is carried out by comparing the mapping entry of the next snapshot, except the 
current snapshot entry. However, the snapshot technique according to the present invention 
can execute all the processes by reading out only the original mapping entry without regard to 
the number of the snapshots. Consequently, as the number of the snapshots are larger, the 
performance of the destruction operation is enhanced much more. 

[0078] Fig. 6 is a flowchart illustrating the method of destroying the snapshot 
according to the present invention. Hereinafter, the method of destroying the snapshot 
according to the present invention will be described below with reference to Fig. 6. 

[0079] In the snapshot destruction operation, the data block allocated by the COW 
operation should be deallocated. Then, the snapshot volume is destroyed after destroying the 
mapping table for the snapshot. 

[0080] If the snapshot destruction is requested, the volume operation mode of the 
mapping server is changed into the snapshot destroy mode (601). The change of the operation 
mode into the snapshot destroy mode is for the purpose of preventing the execution of the 
COW operation to the corresponding snapshot of the data block generated before the snapshot 
destruction operation is completed. The FAB 407 and the SSB 408 of the original mapping 
table entry are examined. In order to determine whether to deallocate the data block. 
Accordingly, the position of the disk block storing the original volume mapping entry is 
obtained (602) in order to read out the mapping entry for the access to the data block. The 
mapping entry is obtained by reading the mapping block from the disk block into the memory 
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(603). The operation of examining the execution of the COW is carried out to all the entries 
of the mapping block in order. 

[0081] Then, it is determined whether the COW operation is carried out using the 
FAB and SSB values of the mapping entry (604). If the COW for the data block which 
5 mapped by the mapping entry is not carried out, a procedure of examining the next mapping 
entry is carried out (609). If the COW for the data block has been carried out, a procedure of 
determining whether to deallocate the data block is carried out (606). The case of 
deallocating the data block of which the COW has been carried out is two. One is the case 
that the next snapshot does not exist, and the other is the case that the COW operation is 

10 carried out to the same data block after creating the snapshot when the next snapshot exists. 
In the above two cases, the data block should be deallocated (607), and the SSB value 
corresponding to the state bit value of the current snapshot of the mapping entry is initialized 
to "0" (608). In case that the data block is not deallocated, only the SSB value is initialized to 
"0". By doing so, the execution to one mapping entry is completed. 

15 [0082] It is examined whether the executions to all the entries existing in the mapping 

block are completed (609). If not completed, the process repeats the steps 605 to 608 with 
respect to the next mapping entry. If completed, the operation of reflecting the mapping 
block in the disk is carried out when more than one COW operation occurs (610). 

[0083] If the write operation to the mapping block is completed, the mapping block is 

20 unlocked (611). And then, it is examined whether the executions to all the mapping blocks 
are completed (612). If the mapping block to be executed exists, the process repeats the steps 
602 to 611. 

[0084] If the executions to all the mapping blocks are completed, an operation of 
destroying the snapshot volume from the host is carried out (613). If the snapshot volume is 
25 destroyed, an actual snapshot destruction execution is completed and an operation of 



reflecting the configuration information of the original volume to be changed due to the 
snapshot destruction operation is carried out (614). Finally, if the operation mode of the 
volume existing in the mapping server is changed into the normal mode (615), the snapshot 
destruction operation is completed. 

[0085] 3. DATA WRITE OPERATION 

[0086] The performance of the snapshot can be determined by evaluating how 
efficiently the write operation for the data block is performed when the data block is updated 
after the snapshot creation. The read operation of the snapshot mode is carried out in the 
same manner as that of the normal mode. In other words, the physical data block equal to the 
logical block is obtained through the mapping, and data is read out from the obtained physical 
data block. 

[0087] When the snapshot exists, the operation causing the degradation of the 
performance is a write operation of reflecting the change of the data block. In the snapshot 
technique based on the mapping table, the write operation is carried out in two cases. 

[0088] A first case is a data block which is not allocated/used before the creation of 
the snapshot and newly allocated/used after the creation of the snapshot. Since the snapshot 
volume maintains only the volume data image at the moment when the snapshot is created, it 
has no concern with the data used after the snapshot creation. 

[0089] Without additional processes in the same manner as the write operation of the 
normal mode, the data used after the snapshot creation is allocated from the free space 
manager, the physical address of the data block is reflected in the mapping entry of the 
original volume, and the write operation of writing the change of the contents to the disk 
block is carried out. 

[0090] A second case is a data block used before the snapshot creation and changed 
after the snapshot creation. The snapshot should maintain the volume data corresponding to 
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the moment of the creation as it is. Therefore, the contents of the data used before the 
creation of the snapshot should be maintained even when the contents are updated. The above 
operation carried out in order to maintain the contents of the data block allocated before the 
snapshot is the copy-on-write (COW) operation. 
5 [0091] The COW operation is an operation that should be carried out in the same 

manner in the snapshot technique based on the mapping table. A problem is the write 
operation for the data block allocated after the COW operation. The conventional snapshot 
technique determines whether the COW operation is carried out or not by reading out both the 
original mapping entry and the snapshot mapping entry and checking whether the addresses 
10 of the physical block mapped by the two entries are equal to each other. 

[0092] In other words, the I/O operation to the snapshot mapping block is 
additionally necessary. If the number of the snapshots increases, the number of disk I/O 
operation also increases in proportion to the number of the snapshots. For example, if the 
number of the snapshots is two, the I/O operation is carried out two times, and if three, the I/O 
15 operation is carried out three times. By doing so, the performance of the write operation is 
degraded in proportional to the number of the existing snapshots. 

[0093] The present invention solves the problem of the conventional write operation 
by using a following method. If the volume mode is the snapshot mode and the contents of 
the data block allocated/used before the snapshot is first changed after the snapshot creation, 
2 0 the value of the SSB is changed into "1". 

[0094] In other words, the value of the SSB corresponding to the snapshot of the 
mapping entry of which the COW operation is carried out is changed into "1". If the write 
request for the same data block which COW is already performed is occurred, the 
determination of the COW operation is processed using the SSB of the mapping entry of the 
2 5 original volume. The snapshot technique according to the present invention can achieve the 



operation through the original volume mapping entry without accessing the snapshot mapping 
entry on the disk and comparing it with the original mapping entry. 

[0095] As the number of the snapshots increases, the performance increases much 
more. The write operation carried out when the data block generated after the COW 
5 operation is changed has the same performance as the write operation of the normal mode. 

[0096] Fig. 7 is a flowchart illustrating the process of the I/O request according to the 
present invention, when the snapshot exists. If the I/O request to the volume occurs, the 
physical disk and the address of the mapping block storing the mapping information on the 
data block are obtained (701). 

10 [0097] An exclusive lock mode for the mapping block is acquired (702). Then, the 

mapping block is read out from the disk into the buffer of the memory, and the mapping entry 
corresponding to the logical address is obtained (703). The operation mode of the volume is 
examined (704). In other words, it is examined whether or not the current I/O is generated 
during the creation or destruction of the snapshot. 

15 [0098] If the operation mode of the volume is the normal mode NORMAL, it is 

examined whether the snapshot exists in the volume (705). If the snapshot does not exist in 
the volume, the data block is recorded in the disk like the general write operation (716). The 
lock for the mapping block is released (717). If the snapshot of the volume exists, it is 
examined whether or not the data block is used before the creation of the snapshot (707). The 

2 0 COW operation is not carried out to the data block allocated and used after the creation of the 
snapshot. 

[0099] The updated contents are written on the disk of the data block (716), and the 
lock for the mapping block is released (717). If the data block is allocated before the snapshot 
creation, it is examined whether or not the COW operation is already carried out after the 
2 5 snapshot creation (708). 



[00100] If the COW operation is already carried out, the data block is written on the 
disk (716), and the lock for the mapping block is released (717). If the COW operation is not 
yet carried out, the COW operation should be carried out. The snapshot mapping block 
corresponding to the same logical address as the original mapping block is read out into the 
5 buffer to thereby obtain the snapshot mapping entry (709), and a new physical data block is 
allocated in order to carry out the COW operation (710). Then, the contents of the data block 
are copied to the newly allocated data block, and the copied data block is written on the 
volume disk (711). 

[00101] The physical address mapped by the snapshot mapping entry is modified with 
10 the address of the newly allocated data block (712), and the value of the SSB with respect to 
the current snapshot of the original mapping entry is changed into "1" (713). The snapshot 
mapping block is recorded in the disk (714). Then, the original mapping block is recorded in 
the disk (715). After the steps 709 to 715 of carrying out the COW operation, the contents of 
the data block are recorded in the disk (716), and the lock for the mapping block is released, 
1 5 thereby ending the operation (717). 

[00102] If the operation mode of the volume is not the normal mode but the snapshot 
creation mode at the step 704, it is examined whether or not the copy operation is carried out 
to the mapping data block including the mapping entry (706). If the copy operation is 
completed, the value of the SSB is checked in order to determine whether the COW operation 
20 is carried out. If the value of the SSB is "1", it means that the COW operation is already 
carried out, so that the data block is recorded in the disk (716). Then, the process proceeds to 
the step 717. 

[00103] If the value of the SSB is "0", the COW operation is carried out. After the 
steps 709 to 715 of carrying out the COW operation, the data block is recorded in the disk 
2 5 (716), and the locking of the mapping block is unlocked, thereby ending the operation (717). 



If the copy operation is not completed, the COW operation should be carried out. The 
operation is ended after carrying out the steps 709 to 717. If the COW operation is carried out, 
the copy operation is automatically carried out to the snapshot mapping block. 

[00104] As described above, according to the efficient snapshot method for the shared 
large storage has advantages in that the application programs can be simultaneously executed 
during the operation of creating the snapshot in the shard storage supporting large capacity 
based on SAN environment. Further, the performance of the write operation occurring after 
the snapshot creation is enhanced, so that data availability and reliability are secured and the 
on-line backup is supported without the degradation of performance in an enterprise system 
requiring the high availability of 24X7X365, such as web server or electronic commerce. 

[00105] The above descriptions are for a kind of embodiment to implement a data 
transfer protocol control system and method with a host bus interface according to the present 
invention. The present invention is not bounded to the embodiment. It will be apparent to 
those skilled in the art that various modifications and variations can be made in the present 
invention. Thus, it is intended that the present invention covers the modifications and 
variations of this invention provided they come within the scope of the appended claims and 
their equivalents. 
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